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Abstract — The Marton-Gelfand-Pinsker inner bound on the 
capacity region of broadcast channels was extended by Han- 
Costa to include arbitrarily correlated sources where the capacity 
region is replaced by an admissible source region. The main ar- 
guments of Han-Costa are correct but unfortunately the authors 
overlooked an inequality in their derivation. The corrected region 
is presented and the absence of the omitted inequality is shown 
to sometimes admit sources that are not admissible. 

I. Introduction 

We borrow terminology from [1] with minor modifica- 
tions. Consider a two-receiver broadcast channel (BC), say u>, 
with correlated, or more precisely dependent, sources (S,T). 
Source ($, T) is said to be admissible for this BC if for any A, 
< A < 1, and for large enough n there is a code with length- 
en, codewords such that Pei < A and Pe 2 < A, where Pe\ and 
Pe 2 are the respective error probabilities for receivers 1 and 
2. The set of all admissible sources is called the admissible 
source region. 

Han and Costa developed a (purported) subset of the 
admissible source region for BCs with arbitrarily correlated 
sources in [1, Theorem 1 and Example 1]. We observe that the 
main arguments in [1] are valid but the authors unfortunately 
overlooked an inequality in one of the final steps of their 
proof. The corrected versions of [1, Theorem 1 and Example 
1] are presented in Sec. [TTJ On the other hand, since the 
admissible source region is not known in general, it is not a 
priori clear whether or not the Han-Costa source set is in fact 
a subset of the admissible source region after all. We rule out 
this possibility by giving two examples where the Han-Costa 
source set includes sources that are not admissible. 

II. Revised Theorem 1 and Example 1 in [1] 

The wording of the theorem and the example below are 
taken with minor modifications from [1]. 

Theorem 1 (revised from [1]): Suppose that a broadcast 
channel u> and a source (S, T) are given, and let K = 
f(S) = g(T) be the common variable in the sense of Gacs 
and Korner (and also Witsenhausen). If there exist auxiliary 
random variables W, U, V (with values in finite sets) that 
satisfy the Markov chain property 



and the inequalities 

H(S) < I(SWU;Yx) -I(T; WU\S) (2) 
H(T) < I(TWV; Y 2 ) — I(S\ WV\T) (3) 
H(ST) < min{I [KW-Yx) J (KW]Y 2 )} + I [SU-^KW) 
+ I(TV;Y 2 \KW) - I(SU;TV\KW) (4) 
H(ST) < I(SWU; Yi) + I(TWV; Y 2 ) - I(SU; TV\KW) 
-I{ST\KW). (5) 

then the source (S,T) is admissible for the channel to. Here, 
X is an input variable with values in the input alphabet X, 
and Yi , Y 2 are the output variables with values in the output 
alphabets yi,y 2 , respectively, induced by X via u>. 

Example 1 (revised from [1]): Consider sources with 5* = 
(S ,K),T = (T ,K), where Sq,T ,K are statistically inde- 
pendent, and where H(K) = R , H(S ) = R\, H(T ) = R 2 . 
If we choose WUV to be independent of ST, then the 
conditions of Theorem Q] reduce to the Markov chain property 



WUV -X- Y X Y 2 



(6) 



and the inequalities 



ST - WUV - X - Y X Y 2 



(1) 



Ro + Ri <J(W;Yi) (7) 
Ro + Ri < I{WV; Y 2 ) (8) 
R + Ri+R 2 < rmn{I{W)Y 1 ),I{W)Y 2 )} + I{U;Y x \W) 
+ I(V;Y 2 \W) -I(U;V\W) (9) 
2R Q + R 1 +R 2 < I{WU; Y ± ) + I(WV; Y 2 ) ~ I(U; V\W). 

(10) 

Remark 1: Inequalities (0 and ( TToT i are missing in [1]. Note 
that the revised Example 1 is a special case of a more general 
result that appeared in the Ph.D. thesis of Y. Liang in 2005 
(see [2, p. 89, Remark 10] and [3, Theorem 5]). Note also 
that the rate region doli-dToTi was shown to be equivalent to 
the Marton-Gelfand-Pinsker region in [4] (what we call the 
"Marton-Gelfand-Pinsker region" is given in [5, Theorem 1] 
and [6, p. 391, Problem 10(c)]). 



A. Solving the Case of the Missing Inequality 

In [1, p. 647], the authors derive the following valid 
inequalities, see Equations (3.34)-(3.37): 

H{S\KW) + H{T\KW) + H(K) 

<I(TVW;Y 2 ) + I(SU;Y 1 \KW)- P o-pi-p 2 
H{T\KW) + H(K) < I(TVW; Y 2 ) - Po - p 2 
H(S\KW) + H{T\KW) + H(K) 

< I(SUW; Yi) + I (TV; Y 2 \KW) -p -pi-p 2 
H(S\KW) + H(K) < I(SUW; Yi) - p - Pi- 

They next eliminate the variables po,pi, p 2 using the following 
inequalities in [1, p. 645], see Equations (3.5)-(3.8): 

po > I(ST;W\K) 
fH > I{T;U\SW) 
P2 > I{S;V\TW) 
P i+p 2 > I{SU; TV\W) - I(S; T\W). 

The oversight occurs in this elimination. Eliminating po, pi,p 2 
and removing redundant inequalities we obtain the bounds 
in [1, (3.38)-(3.41)]: 

H(S\KW) + H(T\KW) + H(K) + I(ST; W\K) 

<I{TVW;Y 2 )+I{SU;Y 1 \KW)-A (11) 
H(T\KW) + H(K) 

< I(TVW; Y 2 ) - I (ST; W\K) - I(S; V\TW) (12) 
H{T\KW) + H(S\KW) + H(K) + I (ST; W\K) 

< I{SUW;Y 1 )+I{TV;Y 2 \KW)~A (13) 
H(S\KW)+H(K) 

< I(SUW; Yi) - I(ST; W\K) - I(T; U\SW) (14) 

where A = I(SU; TV\W)-I(S; T\W), as well as the bound 

H(S\KW) + H(T\KW) + 2H(K) + 21 (ST; W\K) 
< I(SUW; Yi) + I(TVW; Y 2 ) — A. (15) 

It is the inequality (JT3J that was omitted in [1]. 

Continuing as in [1, p. 647], we obtain the revised Theo- 
rem 1 by using the equalities 

H(S\KW) + H(K) = H(S) - I(S; W\K) 
H(T\KW) + H(K) = H(T) - I(T; W\K) 
H(S\KW) + H(T\KW) + H(K) 

= H(ST) + I(S; T\K) - I(S; W\K) ~ I(T; W\K). 

B. Counterexample \II-B\ 

Since the set-up of Example 1 is a well-studied and impor- 
tant case, we explore the following question: If we remove the 
inequality (TlOb then is the resulting rate region (the Han-Costa 
region of [1, Example 1]) always achievable? We develop two 
counterexamples to show that this is not the case. The reader 
will notice that the counterexamples are closely related. We 
present them both for reasons that will become clear in Sec.HTTI 



As a first counterexample, consider the deterministic BC 



(Yi,Y 2 ) = 



(0,0), 


X 


= 


(1.0), 


X 


= 1 


(1.1), 


X 


= 2 


(2,1), 


X 


= 3 



(16) 



The capacity region of a deterministic BC is known to be the 
union over Markov chains W — X — Yj Y 2 of the non-negative 
rate triples (Rq, Ri, R 2 ) satisfying (see [6, p. 391]) 

Ro < mia(I(W; Y{) t I(W\ Y 2 )) (17) 
R + Ri<H(Y 1 ) (18) 
Ro + R 2 < H(Y 2 ) (19) 

Ro + Ri+R 2 < min (I(W; Y x ), I(W; Y 2 )) + H(Y{Y 2 \W). 

(20) 

We mimic the development of [7, Sec. IV]. Suppose we would 
like to achieve 

R Q + Ri+R 2 =H(Y 1 Y 2 ) (21) 

for the BC (Q~6]l. For example, we can achieve (Ro, Ri,R 2 ) = 
(0, 1, 1) and H(YiY 2 ) = 2 by choosing W to be a constant 
and X uniform. It is easy to check that for (l2TT i to be satisfied, 
one must have the double Markov relations 

W-Y x -Y 2 (22) 
W -Y 2 ~Y X (23) 

in the expression d20l i. 

Suppose next that we would like to achieve 

Ro + Ri+R 2 =2 (24) 

for the BC (fTST l. as in the example we just considered. 
Obviously, the input X must be uniform, and for this choice 
of X one can check that the joint distribution of (Yi,Y2) 
is indecomposable in the sense of [6, p. 350]. This further 
implies, by [6, p. 402], that 

W is independent of Y X Y 2 (25) 

and therefore, by ( fTTI i. that Ro = 0. One can further check 
that with uniform X, but without the bound the following 
rate-triple is permitted 

(Ro,R u R 2 ) = (1/2,1,1/2) (26) 

Thus, the bound (fTTI i is needed because the rate-triple (|26| | is 
not achievable. 

Finally, note that we are further suggesting that one replace 
(fTTI i with the bound (fT0]i where U = Y x and V = Y 2 , i.e, with 

2R + R X +R 2 < I(W; Y x ) + I(W; Y 2 ) + H(Y 1 Y 2 \W). 

(27) 

For example, if Rq + Ri + R 2 = 2 then from (|25T l and (f27j 
we see that we must have 

2R + R 1 +R 2 <H(Y 1 Y 2 ) = 2 (28) 

so that Rq = 0. Summarizing, we need to add the bound 
dT7t or the bound JTO) to the bounds (2.14)-(2.16) in [1]. The 
equivalence of adding either bound was proved for general 
broadcast channels in [4]. 



C. Counterexample \II-C\ 

Consider the Blackwell BC shown in Fig. Q] This channel 
is deterministic so the capacity region is given by (fT7T> - d20b 
where W — X — Y{Y 2 forms a Markov chain. We have the fol- 
lowing lemma that is closely related to Counterexample Hl-BI 
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Fig. 1. Blackwell Channel 



Lemma 1: If W — X — Y{Y 2 forms a Markov chain and 

min (J(W; F1), I(W; Y 2 )) + H{Y X Y 2 \W) = H(Y\Y 2 ) 

for the Blackwell channel in Fig. Q] then the random variables 
W and X are independent. 

Before we prove Lemma [1] we claim that the lemma 
provides a counterexample to our question posed above. In 
particular, the lemma implies that if R0+R1+R2 — > H(Y\Y 2 ), 
then we must have R Q ( < min {I(W; Y t ), I(W; Y 2 )) ) -> 0. 
Thus the triple R = I(Y 1 ;Y 2 ) - e, R ± = H{Y 1 \Y 2 ),R 2 = 
H (Y 2 \Yi), is not achievable for e small enough. However, this 
rate-triple is permitted by [1, Example 1]. 
Proof: (Lemma [TJ The equality 

min (I(W; Fi), I(W; Y 2 )) + H{Y X Y 2 \W) = H{Y 1 Y 2 ) 

implies that I{W]Y X ) = I{W;Y 2 ) = I(W;YiY 2 ). Using the 
observation that for a Blackwell channel X is a deterministic 
function of Y{Y 2 , we have the following equalities 

I(W;Y 1 ) = I(W;X) (29) 
I(W;Y 2 )=I(W;X). (30) 

From ( 1291 and the Markov relationship W — X — Y\ we see 
that I{W;X\Y{) = and therefore 

P{X = 0\Yi = 0, W = w) = P(X = 0\Yi = 0). (31) 

For the Blackwell Channel, ( IBTT l is equivalent to 

P(X = 0) 
T(X = 0) + P(X = 1) 

P{X = 0\W = w) 



Similarly starting from d30b and the Markov relationship 
W — X — Y 2 , we compute 

P(X = 2) = P(X = 2\W = w) 
P(X=1) P(X = l\W = w)' K ' 

From d32t and ( f33l > we deduce 

P(X = i\W = w) = P(X = i), for i = 0, 1, 2. 

This concludes the proof of Lemma Q] ■ 

III. Historical Remarks 

The revised Theorem 1 and Example IH-BI were developed 
by G. Kramer in the summer of 2005. His motivation was that 
S. Shamai pointed out to him that the potential improvement 
(|6]l-(fT0ll of the Marton-Gelfand-Pinsker region that appeared 
in the Ph.D. thesis of Y. Liang (see [2, p. 89, Remark 10] and 
[3, Theorem 5]) was superseded by the earlier results of Han- 
Costa [1]. Kramer communicated the revised Theorem 1 and 
Example III-BI to Shamai and Han in August 2005 via email 
but did not otherwise document the results. 

In 2008, Y.-H. Kim queried C. Nair about the validity of the 
results in Han-Costa [1]. Nair independently discovered and 
corrected the error of [1] in 2008 and developed Example lll-CI 
He forwarded a write-up of his results to A. El Gamal and 
M. Costa. Costa forwarded the write-up to Han, who then 
replied back with the earlier communication by Kramer. This 
eventually led to the current joint paper. 
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P(X = 0\W = w)+ P(X = 1\W = w) ' 
Thus we obtain 

P{X = 0) _ P(X = 0\W = w) 
P{X = 1) _ P(X = l\W = w)' 



